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ABSTRACT The coronavirus responsible for the severe acute respiratory syndrome contains a small envelope protein, E, with 
putative involvement in host apoptosis and virus morphogenesis. To perform these functions, it has been suggested that protein 
E can form a membrane destabilizing transmembrane (TM) hairpin, or homooligomerize to form a TM pore. Indeed, in a recent 
study we reported that the a-helical putative transmembrane domain of E protein (ETM) forms several SDS-resistant TM 
interactions: a dimer, a trimer, and two pentameric forms. Further, these interactions were found to be evolutionarily conserved. 
Herein, we have studied multiple isotopically labeled ETM peptides reconstituted in model lipid bilayers, using the orientational 
parameters derived from infrared dichroic data. We show that the topology of ETM is consistent with a regular TM a-helix. 
Further, the orientational parameters obtained unequivocally correspond to a homopentameric model, by comparison with 
previous predictions. We have independently confirmed that the full polypeptide of E protein can also aggregate as pentamers 
after expression in Escherichia coli. This interaction must be stabilized, at least partially, at the TM domain. The model we report 
for this pentameric a-helical bundle may explain some of the permabilizing properties of protein E, and should be the basis of 


mutagenesis efforts in future functional studies. 


INTRODUCTION 


The causative agent of the severe acute respiratory syndrome 
(SARS) (1) is a member of the family Coronaviridae, which 
cause most common colds in humans and are responsible for 
serious diseases in other animal species. The lipid bilayer 
that surrounds these viruses typically embeds three proteins: 
spike (S), membrane (M), and envelope (E) protein. These pro- 
teins are studied for their important role in receptor binding 
and virion budding. 

The E protein of the SARS-CoV virus is a small, 76 amino 
acid, integral membrane protein with one putative trans- 
membrane a-helical (TMH) hydrophobic domain, 20-30 
amino acids long, flanked by a short N-terminus (<10 amino 
acids) and a longer C-terminus tail, both more hydrophilic. 
This protein is very important for virus morphogenesis, to the 
point that it has been found to be critical for viral budding in 
mouse hepatitis virus (MHV) (2). Also, in many coronavi- 
ruses, coexpression of M and E proteins, which probably 
interact via their cytoplasmic domains in pre-Golgi com- 
partments (3), is necessary for morphogenesis (4,5). 

A further role suggested for protein E is to promote apo- 
ptosis (6,7). It has also been found that SARS-CoV E protein 
can alter membrane permeability when expressed in Esch- 
erichia coli and mammalian cells (8) and in vitro cation- 
selective ion channel formation has been reported using a 
totally synthetic polypeptide (9). This apparent ability to 
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form channels suggests that E protein is able to form TM 
homooligomers, as reported in other similar systems (10,11). 

To determine if, and which, TM oligomers are present 
in SARS-CoV E, we used recently (12) a computational 
method that filters nonnative interactions with the help of 
evolutionary conservation data (13). A search of the possible 
interhelical interactions in the transmembrane domain of E 
protein (ETM) was performed using a total of 17 homologs 
of SARS-CoV E, present in other viruses. We showed that 
only three homooligomeric models had been conserved by 
evolution of the viruses: a dimer, a trimer, and two similar 
pentamers. Due to the special nature of our simulation, 
totally independent from experimental measurements, and 
only dependent on the disturbing effect of conservative 
mutations on non native models, we predicted (12) that all 
these oligomers should be found in vivo. 

This conclusion was consistent with previous data for 
expression of whole protein E in E. coli and mammalian 
cells, where dimers and trimers in SDS were observed, al- 
though no pentamers could be detected (8). We also reported 
in the same study (12) that these three types of oligomers of 
synthetic ETM were stable in SDS, which suggested a 
stabilizing function at the TM domain. Clearly, of these 
oligomeric forms, only a pentamer is consistent with the for- 
mation of a pore that could account for the ion channel 
behavior observed in previous reports (9), but at present the 
oligomeric form of protein E, and ETM, in lipid bilayers is 
not yet known. 

Herein, we have attacked this problem examining the 
structure of isotopically labeled ETM when incorporated in 
model lipid bilayers using polarized attenuated total reflec- 
tion Fourier Transform infrared spectroscopy (PATIR-FTIR). 
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Pentameric Form of Protein E 


Specifically, we have synthesized ETM introducing a labeled 
carbonyl ('5C='8O) (14) at various positions (15), for a total 
of nine residues along the ETM sequence. We have then 
determined the orientation in space of these labeled peptidic 
carbonyl groups using the theory of site specific infrared 
dichroism (SSID) (16). Like any orientational technique, 
SSID has certain limitations. First, it requires a certain order 
of the sample, and therefore is not applicable to proteins in 
solution. Also, it requires a known periodicity in the sample, 
which is found in a-helical segments. Finally, the require- 
ment for discrimination between neighboring peptide car- 
bonyl orientations makes it unsuitable when a-helices are 
either completely parallel or perpendicular to the membrane 
normal. SSID is therefore ideal to study TMH bundles, 
where all the above conditions are met (14,15,17). 

We note that in several reports (14,16,18—20), we have 
used only two labeled residues, at consecutive positions, to 
give support to a particular orientation. This is appropriate 
when alternative models are separated by w angles of at least 
45°, because the error in w using SSID is usually 10—20°. A 
more thorough validation of a particular model, however, 
requires a multiple labeling approach so that experimental 
values and predicted model can be compared at multiple 
sites, as reported previously for CD3¢ (15). 

As in previous work (15), we have compared experimental 
versus predicted tilt and rotational orientation of the peptide. 
For the predicted values, we used those obtained from the 
evolutionarily conserved models during molecular dynamics 
simulations (12). Additionally, we have again addressed the 
problem of the oligomerization of E protein, 1.e., the com- 
plete polypeptide, when expressed in FE. coli. Our results 
provide an important insight into the mode of organization of 
E protein in lipid bilayers. 


EXPERIMENTAL PROCEDURES 
Expression of SARS-CoV E protein in E.coli 


Plasmids of full length C-terminal Histidine tag fused SARS-CoV E protein 
cloned in a pET 24a vector previously described (8) were transformed into 
E. coli strain BL21DE3 pLysS. A single colony was grown in Luria Broth 
medium until the absorbance reached 0.4 at 600 nm. The culture was 
induced with 0.5 mM IPTG, and a part of the culture was transferred to a 
fresh culture vial as uninduced fraction. The induced and uninduced cultures 
were further incubated at 25°C for 6 hr. 


Western blot analysis 


Total cell lysate from the uninduced and induced fractions were prepared by 
sonication, and the slurry obtained was mixed with 2* SDS loading buffer 
(with 150 mM DTT and 2% SDS) and subjected to 15% SDS-PAGE. 
Proteins were transferred to nitrocellulose membrane (Pierce, Rockford, IL) 
and blocked overnight at 4°C in blocking solution (5% fat free milk solution 
in PBS pH 7.4). The membrane was incubated with a 1:1000 diluted primary 
antibody (anti-His tag) in blocking solution for 1 h at room temperature. 
After three washings with PBST, the membrane was incubated with 1:2000 
diluted secondary antibody (anti rabbit HRP conjugate DAKO) in blocking 
solution for 1 h at room temperature. After washing for three times with 
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PBST, the proteins were detected with a chemiluminescence detection 
kit (West Pico, Pierce) according to the manufacturer protocol. 


Preparation of labels and peptides 


Amino acids labeled with carbonyl '*C='8O were obtained as described 
previously (14,21). The amino acids were then derivatized with 9-fluorenyl- 
methyloxycarbonyl (FMOC) (22). 

The peptides corresponding to ETM (Fig. 1 A) were synthesized using 
standard solid phase FMOC chemistry (Intavis ResPep (Gladbach, Germany) 
peptide synthesizer), from residue 9 to 35, adding two lysine (K) residues to 
both N- and C-termini (Fig. 1 B) to increase their solubility (12). The 
peptides were cleaved from the resin with trifluoroacetic acid (TFA) and 
lyophilized. The lyophilized peptides were purified by HPLC and analyzed 
by SDS-PAGE as described previously (12). Lyophilization was performed 
always in the presence of HCl (typically at an approximate molar ratio 20:1, 
HCl/peptide) to avoid the formation of peptide-TFA adducts; consequently, 
the typical TFA band at ~1685 cm ' was not observed in the amide I 
region. Peptide purity was further confirmed by mass spectrometry. During 
the synthesis of ETM, an amino acid bearing a '*C='8O carbonyl group was 
introduced at a specific position (Fig. 1 B, open circles). A mutation was 
introduced in two of the peptides, NISA and F23A (Fig. 1 B, residues 
underlined). In the first case (NI5A), this was done to observe if Asn 
substitution had a critical destabilizing effect on the oligomers, because Asn- 
Asn interactions were predicted to be present in all evolutionarily conserved 
oligomeric models (12). The second mutation, F23A, was assumed to be 
critical in view of its suggested central role in the formation of a putative 
palindromic helical hairpin (23). 
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FIGURE 1 (A) Position of the putative TMH domain (TM, see box) in the 
complete sequence of SARS-CoV E. The residue numbers and N- and 
C-termini are indicated. (B) Peptides synthesized corresponding to ETM, 
labeled at various positions with '*C='%O (O) and cyano-phenylalanine 
(@); two mutated sequences are indicated with a star (right), and the mutated 
residues are underlined. (C) Synthetic peptides used as a control, corre- 
sponding to the TMH domain of CD3-¢, (top), integrin B2 (middle), and 
integrin aM (bottom). (D) Schematic representation of an a-helix with 
parameters w (range 0—359°) and B (range (0O—90°). The peptidic C=O bonds 
isotopically labeled are shown as spheres (C, solid; O, open). The arrows 
indicate the direction of the C=O bond relative to the z axis. This direction 
is obtained from the helix and label dichroic ratios (see Experimental 
Procedures) and is used to determine w. 
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As a control, we used peptides that are a-helical in lipid bilayers, with 
their C- and N-terminus on opposite sides of the membrane, ~25 residues 
buried in the membrane, and which are not known to disrupt liposome in- 
tegrity. These TM peptides, of integrin 62 (82-TM), integrin aM (aM-TM) 
and the component of the T-cell receptor, CD3¢ (CD3 ¢-TM), were 
synthesized without any labels (Fig. 1 C) and purified as described above. 


Cyano-phenylalanine labeling 


The nitrile-derivatized amino acid, 4-cyano-phenylalanine (Phe-CN, Sigma- 
Aldrich, St. Louis, MO) was introduced during the FMOC synthesis in three 
separate peptides at positions F20, F23, or F26 (Fig. 1 B, solid circles). 


Preparation of liposomes 


Unilamellar liposomes of predefined size were obtained as described 
previously (24). Briefly, multilamellar liposomes were obtained by mixing 
lipid and water and were extruded 30 times through a 200-nm pore poly- 
carbonate filter using a mini-extruder (Avanti Polar Lipids, Alabaster, AL). 
DMPC (1,2-dimyristoyl-sn-glycero-3-phosphocholine, Avanti Polar Lipids) 
liposomes containing ETM, B2-TM, aM-TM, or CD3-¢-TM were prepared 
by adding the peptide to the lipid (15:1 DMPC/peptide molar ratio) prior 
extrusion. Typically, | mg of dry peptide was mixed with DMPC and 1 ml of 
HFIP (1,1,1,3,3,3-hexafluoro-2-propanol, Merck, Darmstadt, Germany). The 
solvent was evaporated with a rotary evaporator, before addition of 1 ml of 
H,O. Homogeneity in the diameter of the extruded liposomes was assessed 
using Dynamic Light Scattering (DLS), which was performed using a 90 Plus 
particle size analyzer (Brookhaven Instruments, Holtsville, NY). 


Data collection and sample preparation 
for ATR-FTIR 


Infrared spectra were recorded on a Nicolet Nexus spectrometer (Madison, 
WI) purged with N»> and equipped with a MCT/A detector, cooled with 
liquid nitrogen. Attenuated total reflection (ATR) spectra were measured 
with a 25-reflections ATR accessory from Graseby Specac (Kent, UK) anda 
wire grid polarizer (0.25 zm, Graseby Specac). A total of 200 interferograms 
collected at a resolution of 4 cm! were averaged for every sample and 
processed with 1 point zero filling and Happ-Genzel apodization. The pep- 
tide (typically ~3 mg) was reconstituted in a 15:1 DMPC/peptide molar 
ratio (14) and applied onto a trapezoidal (50 mm X 2 mm X 20 mm) Ge 
internal reflection element (IRE). A dry, or D2O saturated, N> stream flowing 
through the ATR compartment (14) was used to remove bulk water or to 
achieve D,O exchange, respectively. 


H/D exchange 


The percentage of isotopic (hydrogen/deuterium, H/D) exchange was 
calculated from the area ratio Amide II /Amide I, using nonpolarized spectra, 
before and after exchange in D,O, as described previously (25). Non 
polarized spectra were obtained from the parallel (||) and perpendicular (1) 
ATR polarized spectra, using the expression | X (||) + 1.44 X (L), as 
described previously (26). 


Analysis of infrared data 


The data was analyzed according to the theory of site-specific dichroism 
(SSID) presented in detail elsewhere (16). SSID uses the fact that the 
measured dichroic ratio of a particular transition dipole moment, e.g., 
peptidic C=O stretching, is a function of its spatial orientation, defined by 
tilt 6 and rotational pitch angle w (see Fig. 1 D), which is defined arbitrarily 
as 0° when the transition dipole moment, the helix director, and the z axis all 
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reside in a single plane (16). The angle a between the transition dipole 
moment of the vibrational transition and the helix axis is known from fiber 
diffraction studies (27), and was taken as 39° for the peptidic C=O bond and 
29° for the N-H bond (28). The values for the electric field components, é,, éy, 
and ¢,, were those calculated according to a thick film approximation (29). 

Dichroic ratios were calculated as the ratio between the integrated 
absorptions of the spectra collected with parallel and perpendicular polarized 
light. The area of the amide A was calculated by integration of the band 
centered at ~3300 cm‘, between 3200 and 3400 cm~! (14) when the 
sample was equilibrated in D,O. In these conditions, this band corresponds 
only to the membrane-embedded part of ETM. The absorption correspond- 
ing to the label, i.e., the Cc=180 carbonyl stretching vibration, was ob- 
tained integrating the band centered at ~1590 cm! (15,16). Thus, from 
each measurement, two different dichroic ratios, from amide A (Ryejx) and 
label (Ryite), Were obtained (Eqs. 1 and 2, respectively) (16). 

The first (Ryelix, Eq. 1) is the composite dichroism that corresponds to all 
membrane-embedded amide N-H bonds in the helical structure for sample 7. 
This dichroism is independent of w and dependent solely on 6 and the 
fractional order of the preparation, f;. This fractional order is 1 if the sample 
is completely ordered, and O if randomly oriented (16). The parameters 
Kx. y, or z are the rotationally averaged, integrated absorption coefficients, 
whereas Ky y, or z (@) refers to the absorption coefficient at a specific w angle. 


e° (fK, + 4) + et (AK, + +4) 
2 235 
ev (fiky v A) 


_ &(fK,(w) + 8) + e(fiKx(@) + 5") 
Rites; (B.fi, w) =. e, (f,K, (w) ue Lt) 


The other dichroic ratio (Rsite, Eq. 2) corresponds to the isotopic label, 
centered at ~1590 cm’, which depends on three parameters: @ site, B, and fi. 

These two equations are not sufficient to obtain B, @,i., and f, (three 
unknowns), therefore a second label is inserted in a different sample /, either 
immediately before or after the first label (16). There are 3.6 residues per 
turn in a canonical a-helix, therefore Aw between consecutive residues is 
assumed to be 100°. Thus, with the additional labeled sample, two additional 
equations can be obtained, Ryeiix; and Rsitej, dependent on (8, f;) and (6, @ + 
100°, fj), respectively. Solving simultaneously the four equations Ryerxi, 
Ryetixj> Rsitei, aNd Rgite; for each i and j pair, yields B;;, @;;, f,, and f;,, where Bj, 
and w,; are the results obtained from the combinations of sample 7 and 
sample j. These nonlinear equations were solved with Newton’s method as 
implemented in the FindRoot function in Mathematica 5.0 (Wolfram 
Research, Champaign, IL). The final values of 6 and m for a particular pair 
of labels (i, 7) were obtained by averaging (at least three measurements) 
Bi; and wj;, respectively. 

A total of 27 sets of dichroic ratios (Rhejix, Rsite) Were Obtained (for each 
of the 9 labeled peptides in Fig. | B, three independent preparations were 
measured). As described above, these values were combined in consecutive 
labels whenever possible, i.e., (i, i + 1) so that the error in the assumption 
Aw = 100° was a small as possible. In other cases combinations (i, i + 2) 
or (i, i + 3) using Aw of 200° or 300°, respectively was also tested. 
Specifically, the combinations used were V17/L18, L18/L19, L19/L21, 
L21/A22, A22/A23, A23/V24, V24/L27, and L27/L28. These combinations 
were used to obtain helix tilt, 8, and rotational orientation w at each i-labeled 
residue, as well as the contribution of disorder (f) in the samples (15), which 
was usually 0.7—0.9, depending on the sample. 


Ryuetix, (B,fi) = (1) 


Visualization of orientational data 


To visualize the allowed values (w, 6) of a specific labeled residue for a 
particular measurement (Ryeix, Rsite), We used the reasonable assumption 
that the disorder, f, of a particular preparation, is the same for the a-helix and 
for its labeled site. Thus, we first plotted the allowed values of fas a function 
of w and B for the measured Rye. Second, we plotted a similar graph (fas a 
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function of w and B) for the measured R},.);x. AS fshould be the same for both 
helix and site, because they belong to the same sample, we obtained the 
intersection values of these two surfaces and projected them in the (w,6) 
plane. Thus, each pair of dichroic ratios obtained in a particular preparation 
(Rnetixs Rsite) 18 represented by a curve with all the allowed values of w and 8 
for that particular labeled residue. 

It can be shown that when Rhetix < Rsite aNd Rnetix > Reite, this projection 
will be centered at O° and 180°, respectively; for example, the curves for the 
samples with label at V17 and L18. In both cases Rpejix < Rsite (Table 1) 
therefore, the ‘‘smiles’’ for the two residues should be centered initially at 
0°, but because Aw is 100°, we shift the ‘‘smiles’’ for L18, 100° to the left. 
The coordinates w,7 and 6 (for residues 17 and 18) can be found at the 
intersection of the two curves. The same strategy was followed for other 
combinations of labels, shifting the ‘‘smiles’’ of the second label to the left, 
according to Aw, and finding the intersection of the curves. 

The predicted values for (w,8) were obtained from the corresponding 
conserved model in the molecular dynamics simulations reported previously 
(12), calculating the values (w,6) for each monomer, and averaging for all 
the monomers in the oligomer. 


Pore calculations 


The dimensions of the pore for the pentameric model consistent with the 
experimental data were calculated using the software HOLE (30). 


TABLE 1_ Dichroic ratios 

Residue Rhelix R site 
V17 3.6 4.6 
V17 4.1 4.3 
V17 3.6 3.9 
L18 32 3.5 
L18 3.3 3.4 
L18 2.9 3.0 
L19 3.6 3.5 
L19 3.2 2.8 
L19 3.5 3.3 
L21 3.5 6.0 
L21 3.6 6.6 
L21 4.1 6.2 
A22 3.6 4.3 
A22 3.2 4.4 
A22 3.4 4.1 
A23 3.4 2.0 
A23 3.1 2.3 
A23 4.0 253 
V24 4.7 Dat 
V24 4.0 4.3 
V24 3.8 4.1 
L27 4.1 5.0 
L27 4.0 4.6 
L27 3.6 4.1 
L28 4.2 4.4 
L28 4.0 5.1 


L28 3.8 4.1 


Dichroic ratios measured by SSID using the amide A and the isotopic label 
for ETM incorporated in DMPC lipid bilayers (see Experimental Proce- 
dures). The information from each labeled residue was obtained from at 
least three independent measurements. 
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RESULTS 
Electrophoresis 


Electrophoresis performed with all the peptides shown in 
Fig. | B were identical as those obtained previously (12), 
showing bands consistent with dimers, trimers, and penta- 
mers (see Fig. 2). These oligomers were present even in the 
case of the a priori critical mutants NI5A and F23A (Fig. 1 B, 
underlined). These two mutants were synthesized because 
they could affect oligomerization via two possible mecha- 
nisms: N15 might provide stabilization for the TM oligomers 
via interchain hydrogen bonds (31,32), whereas F23 has 
been suggested to have a central role in the formation of a 
putative short hairpin of ETM (23). Intriguingly, we did not 
find a change in oligomerization pattern or electrophoretic 
mobility in any of these mutants, indicating that these two 
residues, N15 and F23, are not essential for ETM oligomer- 
ization in the presence of SDS. 


Liposome morphology 


Previous reports (23) have suggested that ETM inserts into 
lipid bilayers as a short hairpin and it may induce dramatic 
morphological changes in lipid bilayers. 

To test this, we used dynamic light scattering (DLS) to 
monitor possible morphological changes of DMPC liposomes 
incorporating ETM, as compared to a), liposomes without 
peptide or 5), liposomes containing a typical TMH peptide, 
e.g., integrin B2-TM (see Fig. 1 C). In all three cases, the 
calculated diameter of the liposomes was ~170 nm (168.5 = 
1 nm, 168 + 2 nm, and 170.5 = 1.5 nm, respectively), which 
indicates no dramatic disturbing effect of ETM on liposome 
morphology. 


ETM hydrogen/deuterium exchange 


Representative infrared spectra of isotopically labeled ETM 
in DMPC lipid bilayers are shown in Fig. 3. As reported 
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| ea 3 (9) diver 


FIGURE 2 SDS-PAGE electrophoresis corresponding to the synthetic 
TM peptide of SARS protein E. Lane 1 shows the molecular weight markers. 
Lanes 2-4: peptide containing mutation N15A (see Fig. 1 B) with increasing 
load of peptide: 10, 20, and 40 wg, respectively. Arrows indicate the bands 
corresponding to the dimer, trimer, and pentameric forms of the peptide. The 
bands corresponding to the pentamer in lanes 2 and 3 were visible only after 
silver staining. Electrophoresis results for the other peptides shown in Fig. 1 B 
were identical, and are not shown. 
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previously (23), the shape of the amide I absorption band, 
with a maximum at 1657 cm’, is indicative of a predom- 
inant a-helical structure (33). Examination of the amide I and 
II bands in Fig. 3 A indicates that the percentage of hydrogen/ 
deutertum (H/D) exchange of ETM amide groups was small 
(17 + 2%), which indicates that ~26 residues are membrane- 
embedded. A similar number of exchange-protected residues 
(25—26) was obtained with the peptides control (shown in 
Fig. 1 C, not shown). These results suggest that ETM and the 
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FIGURE 3 __ Infrared spectra of ETM incorporated in DMPC lipid bilayers. 
(A) Nonpolarized infrared spectra of the amide I and II region in H,O, either 
at low hydration (solid line) or hydrated in DO (dashed line). The small 
reduction in amide II intensity is due to H/D exchange. (B) Amide I region 
and isotopic '5C='80 label (residue A22), centered at 1590 cm! (see inset), 
obtained at 0° (solid line) or 90° (dashed line) polarization. (C) Same as in B, 
but corresponding to the amide A of the spectrum. 
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TMH peptides shown in Fig. | C share the same topology, 
1e., that of a regular, slightly tilted, TMH. 


ETM backbone orientation 


After confirming that ETM is inserted, and a-helical, in lipid 
bilayers, we investigated the tilt and rotational orientation 
of the peptide, comparing dichroic experimental data and 
predicted orientation (see Experimental Procedures) from the 
computational models (34). An example of the infrared 
dichroic data obtained at two polarizations (see Experimental 
Procedures) is shown for the peptide labeled at A22, for 
amide I and site (Fig. 3 B) and for amide A (Fig. 3 C). 
Spectra for other labeled peptides were similar, and are not 
shown. 

The dichroic ratios obtained from each measurement, 
Rhetix aNd Rite are Shown in Table 1. With each pair of values 
(Rnetix» site), We Can visualize the allowed w- and B-values 
for that particular site, as shown in the ‘“‘smile’’ plots in 
Fig. 4, D—K (see Experimental Procedures). The *‘smiles’’ 
for different residues were found to intersect (1.e., converged 
to a solution) for the pairs 17/18 (Aw = 100°), 18/19 (Aw = 
100°), 19/21 (Aw = 200°), 21/22 (Aw = 100°), 22/23 (Aw = 
100°), 23/24 (Aw = 100°) and 27/28 (Aw = 100°). More 
importantly, Table 2 shows that using these Aw, expected 
for a canonical a-helix, the w-values are consistent with 
the pentamer B oligomer obtained previously using global 
search molecular dynamics (12), except for 27 and az, 
which are closer to pentamer A than to pentamer B. The Aw 
between 24 and 27 is therefore 390° (Fig. 4 J), not 300°. 
Indeed, when we combined 24 and 27 assuming Aw = 300° 
(not shown) w24 was 40°, which is not consistent with the 
value @24 = —70°, obtained with the pair 23/24. The reason 
for the inconsistency may be a slight bend from residues 
24 to 27, and exemplifies the need to use consecutive residues 
in these calculations. Slices through a pentamer in agreement 
with these observations are represented (Fig. 4, L—Q). 

We point out that a solution was found consistent with 
the expected helix periodicity and rotational orientation of 
pentamer B (see Table 2) when using the pairs 18/19 or 
19/21, even with the mutated peptide NI5A (isotopically 
labeled at L19). This suggests that this mutation not only 
does not perturb the oligomeric forms present in SDS (see 
Fig. 2) but also does not alter significantly the structure of the 
oligomer in lipid bilayers. Similarly, our results show that 
the substitution F23A did not affect the orientation of the 
a-helices: when the dichroic data from the label introduced 
at position A23 (Fig. 1 B, asterisks) was combined with V24 
or A22, a solution was found which was consistent with the 
w-values expected for the pentameric B model. Together 
with the electrophoresis results, which show that none of the 
oligomeric forms are affected by mutation F23A (Fig. 2), this 
suggests that F23 does not play any essential role in either 
maintaining the oligomeric structure or in defining a par- 
ticular topology. 


Pentameric Form of Protein E 


943 


Gh 


F23 


F 5 
i av 18D sic 


V14 VI7 L21 £20 


Re 
TAB 


60 


A22 


L286 


ai 
5 


Bk 


FIGURE 4 _ Visualization of the dichroic data obtained and proposed model for the ETM pentamer: (A) plot corresponding to the order parameter, f, of a 
labeled residue as a function of w and B, assuming a Ryi- of 4.2; (B) the same plot for Ryeix, When Rpeix 18 3.5; (C) superposition of the two surfaces in A and B, 
where the intersection corresponds to (w,6) coordinates with the same disorder f ; (D—K) coordinates (w,6) at the intersection shown in C for pairs V17/L18 
(D), L18/L19 (E), L19/L21 (F), L21/A22 (G), A22/A23 (A), A23/V24 (1), V24/L27 (J), and L27/L28 (K); (L—Q), slices for the model in agreement with our 
dichroic data: 11-13 (L), 14-18 (MW), 19-22 (N), 23-26 (O), 27-30 (P) and 31—34 (Q). The residue numbers are indicated. Color code: L, green; V, cyan; I, 


salmon; A, marine; F, blue; N, orange; and S and T, red. 


Phe-CN labeling 


If the topology of ETM is a regular TMH, the side chains of 
the 3 Phe residues (F20, F23, and F26) must be at or near 
the center of the lipid bilayer. To confirm this topology, we 
labeled ETM with Phe-CN, a nitrile derivatized phenylala- 
nine, at these three positions (see Fig. 1 B, solid dots). The 
rationale for this experiment is provided by the sensitivity of 
the C=N stretching vibration of this nitrile derivatized side 


chain to the hydration and polarity of the environment 
(35,36). If the side chain is fully hydrated, the band is 
centered at 2235 cm! in contrast, when inserted deep in 
phospholipid bilayers, this band shifts to lower wavenum- 
bers, at 2229 cm ', and becomes narrower (35). Fig. 5 shows 
the C=N stretching vibration of the three nitrile derivatized 
amino acids when ETM is reconstituted in DMPC lipid 
bilayers. Clearly, for all three Phe residues, the band is 
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TABLE 2 Experimental versus predicted orientational values 
for ETM incorporated in model lipid bilayers 


Residue Dimer Trimer Pentamer A Pentamer B w (exp) BP (exp) 
V17 61 (12) —60 (36) -—31 (25) -—80(21) -—47 +520 2 
L18 —176 (6) 52 (37) 57 (24) 8 (26) 58 +814+2 
L19 —54 (12) —178 (30) 164(17) 110@2) 105 +3272+1 
E24 99 (9) 9 (39) 6 (27) —30 (26) -44+429+2 
A22 —127 (6) 137 (32) 106 (23) 60 (25) 50 +719 +3 
A23 —3 (13) —113 (32) —128 (34) —183 (18) —170 +7162 1 
V24 40 (14) -—21 39) -—32 (27) —-83 23) -70+419=+1 
L27 —2 (14) —52 (36) —64 (22) —113 20) -—-48 +420+ 2 
L28 66 (11) 66 (36) 22 (26) —15 (26) 52° 


Comparison of the rotational orientation (@) and helix tilt (6, within 
parentheses) obtained computationally (14) for the dimeric, trimeric, and 
pentameric (pentamer A and pentamer B) models with the values for w and 
f obtained herein by SSID (exp, shown in last two columns). 

*Obtained adding 100° to woz. 


centered at ~2230 cm’. This indicates a generally hydro- 
phobic and dehydrated environment, consistent with these 
Phe residues being deeply buried in the lipid bilayer. 


Homopentamers detected after E expression 
in E. coli cells 


To add support to a pentameric form of SARS-CoV E in 
biological membranes, we expressed the full length of pro- 
tein E (shown in Fig. 1 A) in E. coli cells (see Experimental 
Procedures). After electrophoresis of the extract in reducing 
conditions, where disulfide covalent bonds are not possible, 
mainly monomers (~9 kDa) and pentamers (~46 kDa) were 
detected (Fig. 6, Jane 2), and also some dimers (~17 kDa). 
This suggests that not only protein E pentameric oligomer- 
ization may be present in vivo, but also that is mediated, at 
least partly, by TM interactions. 


DISCUSSION 


We have shown that SDS-resistant oligomers of ETM, con- 
sistent with dimers, trimers, and pentamers, can be observed 
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FIGURE 5 Spectral features of the C=N stretching vibration for F20 


(dotted line), F23 (dashed line), and F26 (solid line) cyano-phenylalanine 
when ETM was reconstituted in DMPC lipid bilayers. 
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FIGURE 6 Expression and oligomerization of SARS-CoV E protein. The 
His-tagged E protein expressed in FE. Coli BL21DE3pLysS cells was 
analyzed for expression as uninduced and induced total cell lysate fractions 
(lanes I and 2), respectively, on a 15% SDS polyacrylamide gels in the 
presence of 150 mM DTT and 2% SDS. The proteins were transferred to a 
nitrocellulose membrane and probed with polyclonal anti His tag antibody 
in 1:1000 dilution. This was by followed incubating with horse radish- 
peroxidase conjugated anti rabbit IgG and the E protein was detected by a 
chemiluminescence detection kit. Numbers on the right indicate molecular 
masses (kDa). 


when ETM is synthesized with two flanking lysine residues. 
This modification, which we also used in a previous report 
(12) was made to improve solubility and purification (37). It 
has been demonstrated (38,39) for various TM oligomers, 
e.g., monomeric epidermal growth factor receptor (EGER), 
dimeric glycophorin A (GpA), and tetrameric M2 from In- 
fluenza A, that even with up to four flanking lysines at each 
C- and N-termini, oligomerization is not affected during SDS 
PAGE electrophoresis. Also, electrophoretic mobility has 
been reported to be reduced ~30% when three lysines are 
added at each terminus (38). In this work, with only two 
lysines on each side, the error in molecular weight estimation 
should not be higher than 20%. In support that a pentameric 
form of whole E protein is also present in SDS, we show 
bands consistent with pentamers (~46 kDa), along with 
monomers (~9 kDa) and some dimers (~ 17 kDa) of E pro- 
tein when expressed in EF. coli in reducing conditions. In a 
previous work (8) only monomers (~9 kDa), dimers (~18 
kDa), and trimers (~27 kDa) were observed, but only under 
nonreducing conditions. We are unable to explain these 
variations in the oligomeric species observed, but they may 
be related to the complex functionality of SARS- CoV E. In 
any case, the band at (~46 kDa) cannot be interpreted as 
being trimers, because these were observed previously at 
~27 kDa (8). Incidentally, a pentameric homooligomer has 
also been observed in other similarly sized viral membrane 
proteins, e.g., the small (63 residues) hydrophobic protein in 
the respiratory syncytial virus (40,41). It is possible therefore 
that these two proteins perform similar functions. In this re- 
spect, E protein has been found to induce in vivo per- 
meabilization of biological membranes to bulky molecules 
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such as o-nitrophenyl-6-D-galactopyranoside and hygrom- 
ycin B (8). Although a pentameric porelike structure cannot 
explain permeabilization to these big molecules, it can ex- 
plain the permeability to cations observed in an in vitro 
system (9). 

If model B were correct, the narrowest constriction site 
along the lumen occurs at F26 (see Fig. 7), with a diameter of 
only 1.66 A (the effective diameter of a water molecule is 
~2 A). Clearly, the model pentamer B is not consistent with 
an open channel (the diameter of naked Na™ or Ca** is ~2 A, 
and that of K~ is even larger, 2.66 A). We note that similar 
constriction diameters have been found for other channels in 
a closed state, e.g., 1 A in KirBac (42), a bacterial homolog 
of mammalian inward rectifier K channels. Considering 
the model A orientation from residues 24—27 however, the 
channel appears more open. Our experimental data can only 
be explained by a bend of the a-helices around residues 
25-27, with residues 17—24 consistent with pentamer B and 
24 onward consistent with pentamer A. The only partial 
consistency with the computational models may be ex- 
plained by the fact that during the simulations of ETM (12) 
we restrained the helix tilt to a particular value, and effects 
such as bends or kinks, if existed in the native structure, were 
not accounted for. The result of this simplistic assumption 
may have been that, instead of a single pentameric model, 
two different models were obtained with only a partially 
correct orientation in different parts of the sequence. This 
conclusion is supported by the fact that none of these two 
models contained all the lowest energy structures for the 17 
sequences analyzed, although most of them (16) were in 
agreement with model A and only | (PHEV) with model B. 

We have estimated the ion conductance of this putative 
channel, using the pore radius profile (30). The result, 74 pS, 
is remarkably similar to that reported experimentally (80 pS) 


radius (A) 


FIGURE 7 _ Plot showing the pore radius for the model pentamer B (solid 
line) and A (dashed line) as a function of the z coordinate (normal to the 
bilayer). As the orientation of residues 27 and 28 are more similar to 
pentamer A; we also show the plot as a hybrid between pentamer A and B, as 
a thicker line. 
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using a synthetic polypeptide of SARS-CoV E (9). This 
value is also similar to the 50-70 pS predicted for phos- 
pholamban (43), also suggested to be a cation channel, for 
which conductance was experimentally reported to be 100 
pS (44). 

We note that our results are in general inconsistent with 
previous reports (23,45) that suggest that ETM forms a 
membrane-destabilizing, totally embedded, a-helical trans- 
membrane hairpin. Indeed, we show by DLS that ETM has 
no disturbing effect on lipid bilayers, and does not induce 
tubular morphologies. 

Further, we show by H/D exchange that only 25-26 
residues are inserted in the membrane, and this is inconsis- 
tent with a totally embedded ETM, expected for a hairpin 
model (23,45). 

Also, the hairpin model predicts that F23 side chain will 
be at the tip of the hairpin, and located at, or near, the polar 
headgroups of the lipids (23,45). Our result after labeling the 
F residues of ETM with cyanophenylalanine favor a TM 
a-helical topology of ETM, because the frequency of vibra- 
tion in all F residues is similar and centered around 2230 
cm~!, and not closer to 2235 cm~!, which would be ex- 
pected for a more polar environment. 

Our dichroic data is also consistent with a regular TM 
a-helix, i.e., the solution to the system of equations (see 
Experimental Procedures) (16) for the pairs around F23, 
Le., 22/23, or 23/24, converges to a sensible result when 
assuming a-helical canonical periodicity between consecu- 
tive residues (Aw = 100°). Equally for the pair 22/24, if 
Aw ~ 200°. We also note that if the helix tilt determined 
previously without any isotopic label (23) was =6°, we should 
be unable to calculate w-angles for the different residues, 
because the transition dipole moment of any peptidic C=O 
bond would have a similar orientation relative to the z axis. In 
other words, SSID can only provide orientational informa- 
tion if the tilt is at least 10°. In this work, the helix tilt found 
is in the range 20—30°. We conclude therefore that the 
topology of ETM in our preparation is not the same as the 
preparation used to report a TM hairpin (23,45). 

One possible explanation for the discrepancy may be 
found in the fact that our peptide contains 2 lysines flanking 
ETM at both N- and C-termini, whereas these lysines were 
not present in the latter reports. It is possible that the presence 
of these positively charged lysines favors a transmembrane 
topology, and not a hairpin. In any case, the pentameric 
form we report is unlikely to be an artifact, because these TM 
interactions and oligomeric size are consistent with evolu- 
tionary conservation data and expression of the complete 
SARS-CoV E polypeptide, and also consistent with the reported 
ion channel activity. Clearly, more than one topology is pos- 
sible for ETM, especially considering the range of attributed 
roles for this small protein. 

Finally, although channel-like activity appears to be more 
compatible with a regular a-helix than with a short trans- 
membrane hairpin, the latter may explain better the role of 
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E protein in viral morphogenesis. However, we note that a 
palindromic sequence similar to the one described for E 
protein, and which has been suggested to be essential, is only 
present in the SARS sequence, but not in the rest of the 
homologs we have considered (12), which should presum- 
ably have a similar function. 


CONCLUSION 


We have obtained experimental data that shows that ETM, 
flanked with two lysine residues, incorporated in model lipid 
bilayers adopts an a-helical conformation, with a transmem- 
brane topology, 1.e., C- and N-termini in opposite sides of the 
lipid bilayer. By comparison with previous computational 
studies, our orientational data of synthetic ETM labeled iso- 
topically is only consistent with a homopentameric model. 
This pentameric form could be responsible for the 1on chan- 
nel effects observed in vitro in the full length polypeptide of 
protein E. 


J.T. thanks the financial support of the Biomedical Research Council 
of Singapore. 
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